CHAPTER 11 Comparing Average Values between Groups 145

Like the t test, the ANOVA also assumes that the value you are comparing follows

a normal distribution, and that the SDs of the groups you are comparing are sim-

ilar. If your data are not normally distributed, you can use the nonparametric

Kruskal-Wallis test instead of the one-way ANOVA, which we demonstrate later in

the section “Running nonparametric tests.”

Adjusting for a confounding variable

when comparing means

Sometimes you are aware the variable you are comparing, such as reduction in

blood pressure, is influenced by not only a treatment approach (such as drug A

compared to drug B), but also by other confounding variables (such as age,

whether the patient has diabetes, whether the patient smokes tobacco, and so on).

These confounders are considered nuisance variables because they have a known

impact on the outcome, and may be more prevalent in some groups than others.

If a large proportion of the group on drug A were over age 65, and only a small

proportion of those on drug B were over age 65, older age would have an influence

on the outcome that would not be attributable to the drug. Such a situation

would  be confounded by age. (See Chapter  20 for a comprehensive review of

confounding.)

When you are comparing means between groups, you are doing a bivariate com-

parison, meaning you are only involving two variables: the group variable and the

outcome. Adjusting for confounding must be done through a multivariate analysis

using regression.

Comparing means from sets

of matched numbers

Often when biostatisticians consider comparing means between two or more

groups, they are thinking of independent samples of data. When dealing with study

participants, independent samples means that the data you are comparing come

from different groups of participants who are not connected to each other statisti-

cally or literally. But in some scenarios, your intention is to compare means from

matched data, meaning some sort of pairing exists in the data. Here are some

common examples of matched data:»

» The values come from the same participants, but at two or more different

times, such as before and after some kind of treatment, intervention, or event.»

» The values come from a crossover clinical trial, in which the same participant

receives two or more treatments at two or more consecutive phases of the trial.